Conpair: concordance and contamination estimator for matched tumor–normal pairs

نویسندگان

  • Ewa A. Bergmann
  • Bo-Juen Chen
  • Kanika Arora
  • Vladimir Vacic
  • Michael C. Zody
چکیده

MOTIVATION Sequencing of matched tumor and normal samples is the standard study design for reliable detection of somatic alterations. However, even very low levels of cross-sample contamination significantly impact calling of somatic mutations, because contaminant germline variants can be incorrectly interpreted as somatic. There are currently no sequence-only based methods that reliably estimate contamination levels in tumor samples, which frequently display copy number changes. As a solution, we developed Conpair, a tool for detection of sample swaps and cross-individual contamination in whole-genome and whole-exome tumor-normal sequencing experiments. RESULTS On a ladder of in silico contaminated samples, we demonstrated that Conpair reliably measures contamination levels as low as 0.1%, even in presence of copy number changes. We also estimated contamination levels in glioblastoma WGS and WXS tumor-normal datasets from TCGA and showed that they strongly correlate with tumor-normal concordance, as well as with the number of germline variants called as somatic by several widely-used somatic callers. AVAILABILITY AND IMPLEMENTATION The method is available at: https://github.com/nygenome/conpair CONTACT: [email protected] or [email protected] information: Supplementary data are available at Bioinformatics online.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On hazard ratio estimators by proportional hazards models in matched-pair cohort studies

BACKGROUND In matched-pair cohort studies with censored events, the hazard ratio (HR) may be of main interest. However, it is lesser known in epidemiologic literature that the partial maximum likelihood estimator of a common HR conditional on matched pairs is written in a simple form, namely, the ratio of the numbers of two pair-types. Moreover, because HR is a noncollapsible measure and its co...

متن کامل

A Two-Phase Robust Estimation of Process Dispersion Using M-estimator

Parameter estimation is the first step in constructing any control chart. Most estimators of mean and dispersion are sensitive to the presence of outliers. The data may be contaminated by outliers either locally or globally. The exciting robust estimators deal only with global contamination. In this paper a robust estimator for dispersion is proposed to reduce the effect of local contamination ...

متن کامل

Assessing Matched Normal and Tumor Pairs in Next-Generation Sequencing Studies

Next generation sequencing technology has revolutionized the study of cancers. Through matched normal-tumor pairs, it is now possible to identify genome-wide germline and somatic mutations. The generation and analysis of the data requires rigorous quality checks and filtering, and the current analytical pipeline is constantly undergoing improvements. We noted however that in analyzing matched p...

متن کامل

Epstein-Barr virus and familial Hodgkin's disease.

Several studies suggest that the Epstein-Barr virus (EBV) is etiologically linked to Hodgkin's disease (HD). This study was undertaken to examine the role of EBV in familial HD (FHD). Among 60 FHD patients from 27 families with two or more cases per family, we tested available paraffinized tumor tissues from 46 cases by in situ hybridization for EBV-encoded RNA (EBER1) expression. Thirteen of 4...

متن کامل

Estimating a Bounded Normal Mean Relative to Squared Error Loss Function

Let be a random sample from a normal distribution with unknown mean and known variance The usual estimator of the mean, i.e., sample mean is the maximum likelihood estimator which under squared error loss function is minimax and admissible estimator. In many practical situations, is known in advance to lie in an interval, say for some In this case, the maximum likelihood estimator...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 32  شماره 

صفحات  -

تاریخ انتشار 2016